Combining PCFG-LA Models with Dual Decomposition: A Case Study with Function Labels and Binarization

نویسندگان

  • Joseph Le Roux
  • Antoine Rozenknop
  • Jennifer Foster
چکیده

It has recently been shown that different NLP models can be effectively combined using dual decomposition. In this paper we demonstrate that PCFG-LA parsing models are suitable for combination in this way. We experiment with the different models which result from alternative methods of extracting a grammar from a treebank (retaining or discarding function labels, left binarization versus right binarization) and achieve a labeled Parseval F-score of 92.4 on Wall Street Journal Section 23 – this represents an absolute improvement of 0.7 and an error reduction rate of 7% over a strong PCFG-LA product-model baseline. Although we experiment only with binarization and function labels in this study, there is much scope for applying this approach to other grammar extraction strategies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Terminology of Combining the Sentences of Farsi Language with the Viterbi Algorithm and BI-GRAM Labeling

This paper, based on the Viterbi algorithm, selects the most likely combination of different wording from a variety of scenarios. In this regard, the Bi-gram and Unigram tags of each word, based on the letters forming the words, as well as the bigram and unigram labels After the breakdown into the composition or moment of transition from the decomposition to the combination obtained from th...

متن کامل

Evaluation of Price Setting Models in Iran’s Economy (DSGE Approach)

 Despite the consensus on the importance of nominal rigidities, there is no general agreement among monetary economists regarding the most appropriate and consistent pricing model that must be used to assess the effects of monetary policies in the economy. Due to the lack of empirical evidence with relation to the pricing behavior of Iranian firms, there is no general agreement on how to introd...

متن کامل

POINTWISE CONVERGENCE TOPOLOGY AND FUNCTION SPACES IN FUZZY ANALYSIS

We study the space of all continuous fuzzy-valued functions  from a space $X$ into the space of fuzzy numbers $(mathbb{E}sp{1},dsb{infty})$  endowed with the pointwise convergence topology.   Our results generalize the classical ones for  continuous real-valued functions.   The field of applications of this approach seems to be large, since the classical case  allows many known devices to be fi...

متن کامل

Studying impressive parameters on the performance of Persian probabilistic context free grammar parser

In linguistics, a tree bank is a parsed text corpus that annotates syntactic or semantic sentence structure. The exploitation of tree bank data has been important ever since the first large-scale tree bank, The Penn Treebank, was published. However, although originating in computational linguistics, the value of tree bank is becoming more widely appreciated in linguistics research as a whole. F...

متن کامل

Appropriately Handled Prosodic Breaks Help PCFG Parsing

This paper investigates using prosodic information in the form of ToBI break indexes for parsing spontaneous speech. We revisit two previously studied approaches, one that hurt parsing performance and one that achieved minor improvements, and propose a new method that aims to better integrate prosodic breaks into parsing. Although these approaches can improve the performance of basic probabilis...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013